Overview

Dataset statistics

Number of variables20
Number of observations3333
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1020.1 KiB
Average record size in memory313.4 B

Variable types

NUM15
BOOL3
CAT2

Reproduction

Analysis started2020-02-25 12:37:09.203863
Analysis finished2020-02-25 12:37:47.740912
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
State has a high cardinality: 51 distinct values High cardinality
Total day charge is highly correlated with Total day minutesHigh Correlation
Total day minutes is highly correlated with Total day chargeHigh Correlation
Total eve charge is highly correlated with Total eve minutesHigh Correlation
Total eve minutes is highly correlated with Total eve chargeHigh Correlation
Total night charge is highly correlated with Total night minutesHigh Correlation
Total night minutes is highly correlated with Total night chargeHigh Correlation
Total intl charge is highly correlated with Total intl minutesHigh Correlation
Total intl minutes is highly correlated with Total intl chargeHigh Correlation
Number vmail messages has 2411 (72.3%) zeros Zeros
Customer service calls has 697 (20.9%) zeros Zeros

Variables

State
Categorical

HIGH CARDINALITY
Distinct count51
Unique (%)1.5%
Missing0
Missing (%)0.0%
Memory size26.2 KiB
WV
 
106
MN
 
84
NY
 
83
AL
 
80
WI
 
78
Other values (46)
2902
ValueCountFrequency (%) 
WV 106 3.2%
 
MN 84 2.5%
 
NY 83 2.5%
 
AL 80 2.4%
 
WI 78 2.3%
 
OH 78 2.3%
 
OR 78 2.3%
 
VA 77 2.3%
 
WY 77 2.3%
 
CT 74 2.2%
 
Other values (41) 2518 75.5%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 24 100.0%
 
ValueCountFrequency (%) 
Latin 24 100.0%
 
ValueCountFrequency (%) 
ASCII 24 100.0%
 

Account length
Real number (ℝ≥0)

Distinct count212
Unique (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.06480648064806
Minimum1
Maximum243
Zeros0
Zeros (%)0.0%
Memory size26.2 KiB

Quantile statistics

Minimum1
5-th percentile35
Q174
median101
Q3127
95-th percentile167
Maximum243
Range242
Interquartile range (IQR)53

Descriptive statistics

Standard deviation39.82210593
Coefficient of variation (CV)0.3940254508
Kurtosis-0.1078359806
Mean101.0648065
Median Absolute Deviation (MAD)31.82144029
Skewness0.09660629423
Sum336849
Variance1585.800121
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 12.5 32.5 49.5 ... 149.5 166.5 185.5 211. 243. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
105 43 1.3%
 
87 42 1.3%
 
93 40 1.2%
 
101 40 1.2%
 
90 39 1.2%
 
86 38 1.1%
 
95 38 1.1%
 
116 37 1.1%
 
100 37 1.1%
 
112 36 1.1%
 
Other values (202) 2943 88.3%
 
ValueCountFrequency (%) 
1 8 0.2%
 
2 1 < 0.1%
 
3 5 0.2%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
243 1 < 0.1%
 
232 1 < 0.1%
 
225 2 0.1%
 
224 2 0.1%
 
221 1 < 0.1%
 

Area code
Categorical

Distinct count3
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.2 KiB
415
1655
510
840
408
838
ValueCountFrequency (%) 
415 1655 49.7%
 
510 840 25.2%
 
408 838 25.1%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Decimal_Number 5 100.0%
 
ValueCountFrequency (%) 
Common 5 100.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.2 KiB
No
3010
Yes
 
323
ValueCountFrequency (%) 
No 3010 90.3%
 
Yes 323 9.7%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.2 KiB
No
2411
Yes
922
ValueCountFrequency (%) 
No 2411 72.3%
 
Yes 922 27.7%
 

Number vmail messages
Real number (ℝ≥0)

ZEROS
Distinct count46
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.099009900990099
Minimum0
Maximum51
Zeros2411
Zeros (%)72.3%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q320
95-th percentile36
Maximum51
Range51
Interquartile range (IQR)20

Descriptive statistics

Standard deviation13.68836537
Coefficient of variation (CV)1.690128243
Kurtosis-0.05112853879
Mean8.099009901
Median Absolute Deviation (MAD)11.71977792
Skewness1.264823634
Sum26994
Variance187.3713466
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 2. 11.5 18.5 22.5 33.5 39.5 44.5 51. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2411 72.3%
 
31 60 1.8%
 
29 53 1.6%
 
28 51 1.5%
 
33 46 1.4%
 
27 44 1.3%
 
30 44 1.3%
 
24 42 1.3%
 
26 41 1.2%
 
32 41 1.2%
 
Other values (36) 500 15.0%
 
ValueCountFrequency (%) 
0 2411 72.3%
 
4 1 < 0.1%
 
8 2 0.1%
 
9 2 0.1%
 
10 1 < 0.1%
 
ValueCountFrequency (%) 
51 1 < 0.1%
 
50 2 0.1%
 
49 1 < 0.1%
 
48 2 0.1%
 
47 3 0.1%
 

Total day minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1667
Unique (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean179.77509750975094
Minimum0.0
Maximum350.8
Zeros2
Zeros (%)0.1%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile89.92
Q1143.7
median179.4
Q3216.4
95-th percentile270.74
Maximum350.8
Range350.8
Interquartile range (IQR)72.7

Descriptive statistics

Standard deviation54.4673892
Coefficient of variation (CV)0.3029751615
Kurtosis-0.01994037885
Mean179.7750975
Median Absolute Deviation (MAD)43.52345523
Skewness-0.02907706714
Sum599190.4
Variance2966.696487
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 46.95 81.45 109.05 141.05 ... 246.55 274.8 295.7 327.3 350.8 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
174.5 8 0.2%
 
159.5 8 0.2%
 
154 8 0.2%
 
175.4 7 0.2%
 
162.3 7 0.2%
 
183.4 7 0.2%
 
198.4 6 0.2%
 
185 6 0.2%
 
153.5 6 0.2%
 
155.2 6 0.2%
 
Other values (1657) 3264 97.9%
 
ValueCountFrequency (%) 
0 2 0.1%
 
2.6 1 < 0.1%
 
7.8 1 < 0.1%
 
7.9 1 < 0.1%
 
12.5 1 < 0.1%
 
ValueCountFrequency (%) 
350.8 1 < 0.1%
 
346.8 1 < 0.1%
 
345.3 1 < 0.1%
 
337.4 1 < 0.1%
 
335.5 1 < 0.1%
 

Total day calls
Real number (ℝ≥0)

Distinct count119
Unique (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.43564356435644
Minimum0
Maximum165
Zeros2
Zeros (%)0.1%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile67
Q187
median101
Q3114
95-th percentile133
Maximum165
Range165
Interquartile range (IQR)27

Descriptive statistics

Standard deviation20.06908421
Coefficient of variation (CV)0.1998203376
Kurtosis0.2431815246
Mean100.4356436
Median Absolute Deviation (MAD)15.94494301
Skewness-0.111786639
Sum334752
Variance402.7681409
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 38. 53.5 64.5 76.5 ... 123.5 134.5 141.5 151.5 165. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
102 78 2.3%
 
105 75 2.3%
 
107 69 2.1%
 
95 69 2.1%
 
104 68 2.0%
 
108 67 2.0%
 
97 67 2.0%
 
110 66 2.0%
 
106 66 2.0%
 
88 66 2.0%
 
Other values (109) 2642 79.3%
 
ValueCountFrequency (%) 
0 2 0.1%
 
30 1 < 0.1%
 
35 1 < 0.1%
 
36 1 < 0.1%
 
40 2 0.1%
 
ValueCountFrequency (%) 
165 1 < 0.1%
 
163 1 < 0.1%
 
160 1 < 0.1%
 
158 3 0.1%
 
157 1 < 0.1%
 

Total day charge
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1667
Unique (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.562307230723075
Minimum0.0
Maximum59.64
Zeros2
Zeros (%)0.1%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile15.288
Q124.43
median30.5
Q336.79
95-th percentile46.028
Maximum59.64
Range59.64
Interquartile range (IQR)12.36

Descriptive statistics

Standard deviation9.259434554
Coefficient of variation (CV)0.3029690947
Kurtosis-0.01981178724
Mean30.56230723
Median Absolute Deviation (MAD)7.39891389
Skewness-0.02908326834
Sum101864.17
Variance85.73712826
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 7.985 13.845 18.54 23.98 ... 41.915 46.715 50.27 55.645 59.64 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
27.12 8 0.2%
 
26.18 8 0.2%
 
29.67 8 0.2%
 
31.18 7 0.2%
 
27.59 7 0.2%
 
29.82 7 0.2%
 
24.19 6 0.2%
 
28.66 6 0.2%
 
36.72 6 0.2%
 
26.1 6 0.2%
 
Other values (1657) 3264 97.9%
 
ValueCountFrequency (%) 
0 2 0.1%
 
0.44 1 < 0.1%
 
1.33 1 < 0.1%
 
1.34 1 < 0.1%
 
2.13 1 < 0.1%
 
ValueCountFrequency (%) 
59.64 1 < 0.1%
 
58.96 1 < 0.1%
 
58.7 1 < 0.1%
 
57.36 1 < 0.1%
 
57.04 1 < 0.1%
 

Total eve minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1611
Unique (%)48.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.98034803480348
Minimum0.0
Maximum363.7
Zeros1
Zeros (%)< 0.1%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile118.8
Q1166.6
median201.4
Q3235.3
95-th percentile284.3
Maximum363.7
Range363.7
Interquartile range (IQR)68.7

Descriptive statistics

Standard deviation50.71384443
Coefficient of variation (CV)0.2523323545
Kurtosis0.02562975284
Mean200.980348
Median Absolute Deviation (MAD)40.46924431
Skewness-0.02387745608
Sum669867.5
Variance2571.894016
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 57.3 87.2 112.4 128.8 ... 257.6 276.9 294.15 330.2 363.7 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
169.9 9 0.3%
 
230.9 7 0.2%
 
209.4 7 0.2%
 
201 7 0.2%
 
220.6 7 0.2%
 
180.5 7 0.2%
 
161.7 7 0.2%
 
167.2 7 0.2%
 
195.5 7 0.2%
 
194 6 0.2%
 
Other values (1601) 3262 97.9%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
31.2 1 < 0.1%
 
42.2 1 < 0.1%
 
42.5 1 < 0.1%
 
43.9 1 < 0.1%
 
ValueCountFrequency (%) 
363.7 1 < 0.1%
 
361.8 1 < 0.1%
 
354.2 1 < 0.1%
 
351.6 1 < 0.1%
 
350.9 1 < 0.1%
 

Total eve calls
Real number (ℝ≥0)

Distinct count123
Unique (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.11431143114311
Minimum0
Maximum170
Zeros1
Zeros (%)< 0.1%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile67
Q187
median100
Q3114
95-th percentile133
Maximum170
Range170
Interquartile range (IQR)27

Descriptive statistics

Standard deviation19.92262529
Coefficient of variation (CV)0.1989987746
Kurtosis0.206156468
Mean100.1143114
Median Absolute Deviation (MAD)15.86033185
Skewness-0.05556313904
Sum333681
Variance396.9109986
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 42.5 55.5 62.5 73.5 ... 128.5 138.5 144.5 155.5 170. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
105 80 2.4%
 
94 79 2.4%
 
108 71 2.1%
 
97 70 2.1%
 
102 70 2.1%
 
88 69 2.1%
 
101 68 2.0%
 
109 67 2.0%
 
98 66 2.0%
 
111 65 2.0%
 
Other values (113) 2628 78.8%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
12 1 < 0.1%
 
36 1 < 0.1%
 
37 1 < 0.1%
 
42 1 < 0.1%
 
ValueCountFrequency (%) 
170 1 < 0.1%
 
168 1 < 0.1%
 
164 1 < 0.1%
 
159 1 < 0.1%
 
157 1 < 0.1%
 

Total eve charge
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1440
Unique (%)43.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.083540354035403
Minimum0.0
Maximum30.91
Zeros1
Zeros (%)< 0.1%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile10.1
Q114.16
median17.12
Q320
95-th percentile24.17
Maximum30.91
Range30.91
Interquartile range (IQR)5.84

Descriptive statistics

Standard deviation4.310667643
Coefficient of variation (CV)0.2523287067
Kurtosis0.02548740481
Mean17.08354035
Median Absolute Deviation (MAD)3.439937441
Skewness-0.02385798901
Sum56939.44
Variance18.58185553
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 4.87 7.415 9.59 10.95 ... 21.895 23.54 24.9 28.065 30.91 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
14.25 11 0.3%
 
16.12 11 0.3%
 
15.9 10 0.3%
 
18.62 9 0.3%
 
14.44 9 0.3%
 
17.09 9 0.3%
 
17.99 9 0.3%
 
18.79 8 0.2%
 
16.63 8 0.2%
 
17.43 8 0.2%
 
Other values (1430) 3241 97.2%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
2.65 1 < 0.1%
 
3.59 1 < 0.1%
 
3.61 1 < 0.1%
 
3.73 1 < 0.1%
 
ValueCountFrequency (%) 
30.91 1 < 0.1%
 
30.75 1 < 0.1%
 
30.11 1 < 0.1%
 
29.89 1 < 0.1%
 
29.83 1 < 0.1%
 

Total night minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1591
Unique (%)47.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.87203720372037
Minimum23.2
Maximum395.0
Zeros0
Zeros (%)0.0%
Memory size26.2 KiB

Quantile statistics

Minimum23.2
5-th percentile118.18
Q1167
median201.2
Q3235.3
95-th percentile282.84
Maximum395
Range371.8
Interquartile range (IQR)68.3

Descriptive statistics

Standard deviation50.57384701
Coefficient of variation (CV)0.2517714646
Kurtosis0.08581607799
Mean200.8720372
Median Absolute Deviation (MAD)40.41038687
Skewness0.008921291065
Sum669506.5
Variance2557.714002
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 23.2 70.85 101.45 126.65 146.35 ... 272.05 290.5 313.75 334.1 395. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
210 8 0.2%
 
214.6 8 0.2%
 
197.4 8 0.2%
 
191.4 8 0.2%
 
188.2 8 0.2%
 
231.5 7 0.2%
 
221.6 7 0.2%
 
193.6 7 0.2%
 
214.7 7 0.2%
 
194.3 7 0.2%
 
Other values (1581) 3258 97.7%
 
ValueCountFrequency (%) 
23.2 1 < 0.1%
 
43.7 1 < 0.1%
 
45 1 < 0.1%
 
47.4 1 < 0.1%
 
50.1 2 0.1%
 
ValueCountFrequency (%) 
395 1 < 0.1%
 
381.9 1 < 0.1%
 
377.5 1 < 0.1%
 
367.7 1 < 0.1%
 
364.9 1 < 0.1%
 

Total night calls
Real number (ℝ≥0)

Distinct count120
Unique (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.10771077107711
Minimum33
Maximum175
Zeros0
Zeros (%)0.0%
Memory size26.2 KiB

Quantile statistics

Minimum33
5-th percentile68
Q187
median100
Q3113
95-th percentile132
Maximum175
Range142
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.56860935
Coefficient of variation (CV)0.1954755452
Kurtosis-0.07201957894
Mean100.1077108
Median Absolute Deviation (MAD)15.69034149
Skewness0.03249957015
Sum333659
Variance382.9304717
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 33. 52.5 59.5 70.5 81.5 ... 125.5 131.5 140.5 157.5 175. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
105 84 2.5%
 
104 78 2.3%
 
91 76 2.3%
 
102 72 2.2%
 
100 69 2.1%
 
106 69 2.1%
 
98 67 2.0%
 
94 66 2.0%
 
103 65 2.0%
 
108 64 1.9%
 
Other values (110) 2623 78.7%
 
ValueCountFrequency (%) 
33 1 < 0.1%
 
36 1 < 0.1%
 
38 1 < 0.1%
 
42 2 0.1%
 
44 1 < 0.1%
 
ValueCountFrequency (%) 
175 1 < 0.1%
 
166 1 < 0.1%
 
164 1 < 0.1%
 
158 1 < 0.1%
 
157 2 0.1%
 

Total night charge
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count933
Unique (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.03932493249325
Minimum1.04
Maximum17.77
Zeros0
Zeros (%)0.0%
Memory size26.2 KiB

Quantile statistics

Minimum1.04
5-th percentile5.316
Q17.52
median9.05
Q310.59
95-th percentile12.73
Maximum17.77
Range16.73
Interquartile range (IQR)3.07

Descriptive statistics

Standard deviation2.275872838
Coefficient of variation (CV)0.2517746463
Kurtosis0.08566317984
Mean9.039324932
Median Absolute Deviation (MAD)1.818554703
Skewness0.008886236769
Sum30128.07
Variance5.179597173
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1.04 3.19 4.565 5.705 6.585 ... 12.155 13.075 14.115 15.035 17.77 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
9.66 15 0.5%
 
9.45 15 0.5%
 
8.88 14 0.4%
 
8.47 14 0.4%
 
7.69 13 0.4%
 
8.64 12 0.4%
 
9.14 11 0.3%
 
10.35 11 0.3%
 
10.8 11 0.3%
 
9.32 11 0.3%
 
Other values (923) 3206 96.2%
 
ValueCountFrequency (%) 
1.04 1 < 0.1%
 
1.97 1 < 0.1%
 
2.03 1 < 0.1%
 
2.13 1 < 0.1%
 
2.25 2 0.1%
 
ValueCountFrequency (%) 
17.77 1 < 0.1%
 
17.19 1 < 0.1%
 
16.99 1 < 0.1%
 
16.55 1 < 0.1%
 
16.42 1 < 0.1%
 

Total intl minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count162
Unique (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.237293729372938
Minimum0.0
Maximum20.0
Zeros18
Zeros (%)0.5%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile5.7
Q18.5
median10.3
Q312.1
95-th percentile14.7
Maximum20
Range20
Interquartile range (IQR)3.6

Descriptive statistics

Standard deviation2.791839548
Coefficient of variation (CV)0.2727126546
Kurtosis0.6091847602
Mean10.23729373
Median Absolute Deviation (MAD)2.184712135
Skewness-0.2451359395
Sum34120.9
Variance7.794368064
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.55 3.45 4.85 5.75 ... 14.75 15.65 17.05 18.35 20. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10 62 1.9%
 
11.3 59 1.8%
 
9.8 56 1.7%
 
10.9 56 1.7%
 
10.1 53 1.6%
 
10.2 53 1.6%
 
10.6 53 1.6%
 
11.1 52 1.6%
 
11 52 1.6%
 
9.7 51 1.5%
 
Other values (152) 2786 83.6%
 
ValueCountFrequency (%) 
0 18 0.5%
 
1.1 1 < 0.1%
 
1.3 1 < 0.1%
 
2 2 0.1%
 
2.1 2 0.1%
 
ValueCountFrequency (%) 
20 1 < 0.1%
 
18.9 1 < 0.1%
 
18.4 1 < 0.1%
 
18.3 1 < 0.1%
 
18.2 2 0.1%
 

Total intl calls
Real number (ℝ≥0)

Distinct count21
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4794479447944795
Minimum0
Maximum20
Zeros18
Zeros (%)0.5%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q36
95-th percentile9
Maximum20
Range20
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.461214271
Coefficient of variation (CV)0.5494458917
Kurtosis3.083588982
Mean4.479447945
Median Absolute Deviation (MAD)1.88109288
Skewness1.321478166
Sum14930
Variance6.057575686
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 4.5 ... 7.5 9.5 11.5 15.5 20. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 668 20.0%
 
4 619 18.6%
 
2 489 14.7%
 
5 472 14.2%
 
6 336 10.1%
 
7 218 6.5%
 
1 160 4.8%
 
8 116 3.5%
 
9 109 3.3%
 
10 50 1.5%
 
Other values (11) 96 2.9%
 
ValueCountFrequency (%) 
0 18 0.5%
 
1 160 4.8%
 
2 489 14.7%
 
3 668 20.0%
 
4 619 18.6%
 
ValueCountFrequency (%) 
20 1 < 0.1%
 
19 1 < 0.1%
 
18 3 0.1%
 
17 1 < 0.1%
 
16 2 0.1%
 

Total intl charge
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count162
Unique (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7645814581458144
Minimum0.0
Maximum5.4
Zeros18
Zeros (%)0.5%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile1.54
Q12.3
median2.78
Q33.27
95-th percentile3.97
Maximum5.4
Range5.4
Interquartile range (IQR)0.97

Descriptive statistics

Standard deviation0.7537726127
Coefficient of variation (CV)0.2726534284
Kurtosis0.6096104298
Mean2.764581458
Median Absolute Deviation (MAD)0.589880385
Skewness-0.2452865083
Sum9214.35
Variance0.5681731516
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.15 0.935 1.31 1.555 ... 4.01 4.225 4.605 4.955 5.4 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.7 62 1.9%
 
3.05 59 1.8%
 
2.65 56 1.7%
 
2.94 56 1.7%
 
2.73 53 1.6%
 
2.86 53 1.6%
 
2.75 53 1.6%
 
3 52 1.6%
 
2.97 52 1.6%
 
2.62 51 1.5%
 
Other values (152) 2786 83.6%
 
ValueCountFrequency (%) 
0 18 0.5%
 
0.3 1 < 0.1%
 
0.35 1 < 0.1%
 
0.54 2 0.1%
 
0.57 2 0.1%
 
ValueCountFrequency (%) 
5.4 1 < 0.1%
 
5.1 1 < 0.1%
 
4.97 1 < 0.1%
 
4.94 1 < 0.1%
 
4.91 2 0.1%
 

Customer service calls
Real number (ℝ≥0)

ZEROS
Distinct count10
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5628562856285628
Minimum0
Maximum9
Zeros697
Zeros (%)20.9%
Memory size26.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q32
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.315491045
Coefficient of variation (CV)0.8417223368
Kurtosis1.730913655
Mean1.562856286
Median Absolute Deviation (MAD)1.052531716
Skewness1.091359482
Sum5209
Variance1.730516689
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 3.5 4.5 5.5 6.5 9. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 1181 35.4%
 
2 759 22.8%
 
0 697 20.9%
 
3 429 12.9%
 
4 166 5.0%
 
5 66 2.0%
 
6 22 0.7%
 
7 9 0.3%
 
9 2 0.1%
 
8 2 0.1%
 
ValueCountFrequency (%) 
0 697 20.9%
 
1 1181 35.4%
 
2 759 22.8%
 
3 429 12.9%
 
4 166 5.0%
 
ValueCountFrequency (%) 
9 2 0.1%
 
8 2 0.1%
 
7 9 0.3%
 
6 22 0.7%
 
5 66 2.0%
 

Churn
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.2 KiB
0
2850
1
 
483
ValueCountFrequency (%) 
0 2850 85.5%
 
1 483 14.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

StateAccount lengthArea codeInternational planVoice mail planNumber vmail messagesTotal day minutesTotal day callsTotal day chargeTotal eve minutesTotal eve callsTotal eve chargeTotal night minutesTotal night callsTotal night chargeTotal intl minutesTotal intl callsTotal intl chargeCustomer service callsChurn
0KS128415NoYes25265.111045.07197.49916.78244.79111.0110.032.7010
1OH107415NoYes26161.612327.47195.510316.62254.410311.4513.733.7010
2NJ137415NoNo0243.411441.38121.211010.30162.61047.3212.253.2900
3OH84408YesNo0299.47150.9061.9885.26196.9898.866.671.7820
4OK75415YesNo0166.711328.34148.312212.61186.91218.4110.132.7330
5AL118510YesNo0223.49837.98220.610118.75203.91189.186.361.7000
6MA121510NoYes24218.28837.09348.510829.62212.61189.577.572.0330
7MO147415YesNo0157.07926.69103.1948.76211.8969.537.161.9200
8LA117408NoNo0184.59731.37351.68029.89215.8909.718.742.3510
9WV141415YesYes37258.68443.96222.011118.87326.49714.6911.253.0200

Last rows

StateAccount lengthArea codeInternational planVoice mail planNumber vmail messagesTotal day minutesTotal day callsTotal day chargeTotal eve minutesTotal eve callsTotal eve chargeTotal night minutesTotal night callsTotal night chargeTotal intl minutesTotal intl callsTotal intl chargeCustomer service callsChurn
3323IN117415NoNo0118.412620.13249.39721.19227.05610.2213.633.6751
3324WV159415NoNo0169.811428.87197.710516.80193.7828.7211.643.1310
3325OH78408NoNo0193.49932.88116.9889.94243.310910.959.342.5120
3326OH96415NoNo0106.612818.12284.88724.21178.9928.0514.974.0210
3327SC79415NoNo0134.79822.90189.76816.12221.41289.9611.853.1920
3328AZ192415NoYes36156.27726.55215.512618.32279.18312.569.962.6720
3329WV68415NoNo0231.15739.29153.45513.04191.31238.619.642.5930
3330RI28510NoNo0180.810930.74288.85824.55191.9918.6414.163.8120
3331CT184510YesNo0213.810536.35159.68413.57139.21376.265.0101.3520
3332TN74415NoYes25234.411339.85265.98222.60241.47710.8613.743.7000